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Abstract. We consider discrete porous medium equations of the form dtpt = Aiy3(pt), where 
A is the generator of a reversible continuous time Markov chain on a finite set X, and ip is an 
increasing function. We show that these equations arise as gradient flows of certain entropy 
functionals with respect to suitable non-local transportation metrics. This may be seen as a 
discrete analogue of the Wasserstein gradient flow structure for porous medium equations in 
R" discovered by Otto. We present a one-dimensional counterexample to geodesic convexity 
and discuss Gromov-Hausdorff convergence to the Wasserstein metric. 



1. Introduction 

Recently it has been shown that discretisations of heat equations and Fokker-Planck equa- 
tions can be formulated as gradient flows of the entropy with respect to a non-local trans- 
portation metric W on the space of probability measures. Results in this direction have been 
obtained independently in various settings, including Fokker-Planck equations on graphs [5], 
reversible Markov chains [11], and systems of reaction-diffusion equations [14]. Related gradi- 
ent flow structures have been found for fractional heat equations [7] and quantum mechanical 
evolution equations [4, 15]. 

The above-mentioned results can be regarded as discrete counterparts to the by now clas- 
sical result of Jordan, Kinderlehrer and Otto [10], who showed that Fokker-Planck equations 
on are gradient flows of the entropy with respect to the L^-Wasserstein metric on the 
space of probability measures. In this continuous setting there are many other interesting 
partial differential equations which admit a formulation as Wasserstein gradient flow. Among 
them, one of the most prominent examples is the porous medium equation, which has been 
identified as the Wasserstein gradient flow of the Renyi entropy in Otto's seminal paper [16]. 

It seems therefore natural to ask whether a similar gradient flow structure exists for discrete 
versions of porous medium equations. In this paper we show that suitable discretisations 
indeed admit a gradient flow structure. The associated metrics turn out to be variations on 
the metric W, which have already been studied in [11]. 

1.1. The discrete setting. In this paper we let be a finite set. We consider a matrix 
Q : X X X ^ M. satisfying Q{x,y) > for all x / y and "^y^^ Q(^^y) ~ ^- assume 
that Q is irreducible, so that basic Markov chain theory implies the existence of a unique 
invariant probability measure vr on X. We assume that vr is reversible, i.e., the detailed 
balance equations 



Q{x, y)TT{x) = Q{y, x)TT{y) (1.1) 
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hold for all x,y E X. For a function ip : X we write 

AV'(x) := ^ Q{x, y)ip{y) = J] y){i;{y) - V(x)) . (1.2) 

y^X yeX 

The operator A is the generator of a continuous time Markov semigroup {Pt)t>o- By the 
reversibility assumption, this semigroup is selfadjoint on L^{X,tt). Moreover, the set ^{X) 
of probability densities with respect to tt, is invariant under the semigroup {Pt)t>o- 

We shall consider a class of non-local transportation metrics on the space of probability 
densities ^(X). The definition of such a metric [11] requires a weight-function 9 : [0,oo) x 
[0, oo) — )■ [0, oo) satisfying appropriate conditions (see Definition 2.1 below). For p G ,'^{X) 
and X, ?/ G we then define p{x, y) = 9{p{x), p{y)). Inspired by the Benamou-Brenier formula 
[2], we define for po,pi G ^{X) the distance yV{po,pi) by 

w{po,pif := inf \\ f Yl Wy)-H^)fpi^,y)Q{x,y)7r{x) dt\ , 

where the infimum runs over sufficiently regular curves p : [0, 1] — )■ ,'^{X) and ip : [0, 1] x A' — )• 
ffi satisfying the discrete continuity equation 

dtPt{x) + Y (My) - Mx))pt{x,y)Q{x,y) = 

y&X 

for all X £ X, with boundary conditions po = po and po = pi. Actually, W is the Riemannian 
distance induced by a Riemannian metric defined on the interior ^*(Af) of ^{X). 
Consider now the relative entropy functional Ti : ^{X) — t- M defined by 

"^(P) := X] /5(x) vr(x) . 

The following result has been shown in [5, 11, 14]: Solutions to the heat equation dtpt = 
Apt are gradient flow trajectories of % with respect to the metric W, provided that is the 
logarithmic mean defined by 

e{r,s) := /^^-"s" da . (1.3) 
Jo 



1.2. A gradient flow structure for discrete porous medium equations. In this paper 
we shall consider non-linear generalisations of the discrete heat equation. For suitable (see 
Assumption 3.1) strictly increasing functions f : [0, oo) — )• M we consider the discrete porous 
medium type equation given by 

dtpt = A^ipt) , (1.4) 

where A denotes the discrete Laplacian defined in (1.2). This equation can be analysed 
using classical Hilbertian gradient flow techniques in a suitable "discrete i^^^-space" (see 
Proposition 3.2 below). 

Here we are interested in more general gradient flow structures in the spirit of the Wasser- 
stein gradient flow structure for the porous medium equation [16]. For this purpose, for 
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suitable (see Assumption 3.3) strictly convex functions / : [0, oo) — )■ M, we consider the 
entropy functional J- : ^(X) — )• M defined by 



We shall prove the following result. 



Hp) ■■= 22 f(p(^)M^) ■ (1-5) 



Theorem 1.1. Let ip and f be as a above, and let W denote the non-local transportation 
metric induced by 

^ ' ' nr)-r{s)- ^'-'^ 

Then the gradient flow equation of the entropy functional J- with respect to W is the discrete 
porous medium equation dtp = Aip{p). 

Remark 1.2. In the special case where ^p{r) = r and /(r) = rlogr we recover the result from 
[5, 11, 14]. In this case we have 6{r,s) = i^gr-iogs ^ which coincides with (1.3). The possibility 
of allowing more general functions / has already been discussed in [11]. 

Remark 1.3. Of particular interest is the case where if{r) = r"^ and /(r) = for 
some < m < 2. In this case the equation can be considered as a discrete porous medium 
equation (if m > 1) or fast diffusion equation (if m < 1). The functional J-' becomes a Renyi 
entropy J^m, so that Theorem 1.1 can be considered as a discrete analogue of the gradient 
flow structure obtained by Otto [16]. The expression for 6 from (1.6) becomes 

m - 1 r"^ - ,^ ^, 

Several classical means are special cases of this expression. In fact, 02{r, s) = is the arith- 
metic mean, lim.m^iOmi'r'js) is the logarithmic mean, and ^i/2('")S) = is the geometric 
mean. Moreover, 0_i(r, s) = is the harmonic mean, but we shall not consider negative 
values of m in the sequel. 

Remark 1.4. For integer values of m, the porous medium equation takes the form of a chemical 
reaction equation, for which a gradient flow structure has been found in [14]. However, there 
the driving functional is the relative entropy H, so that the associated weight function can be 
expressed in terms of logarithmic means. Here we also allow for different driving functionals, 
such as the Renyi entropy. 

1.3. Geodesic convexity of entropy functionals. Of crucial importance in the theory 
of Wasserstein gradient flows is that the relevant entropy functionals exhibit good convexity 
properties along Wasserstein geodesies. Recent work [8, 13] shows that analogous properties 
hold for the relative entropy functional Ti in some discrete examples. In particular, T-L turns 
out to be convex along W-geodesics in one-dimensional discrete Fokker-Planck equations [13] 
as well as in heat equations on d-dimensional square lattices in arbitrary dimension [8]. 

It thus seems natural to ask whether similar convexity properties hold for the more gen- 
eral functionals along W-geodesics with the appropriate choice of 9 given by (1.6). In 
the continous setting, it follows from a fundamental result by McCann [12] that the Renyi 
entropy J^m{p) = j^n /0™(x) dx is displacement convex, i.e., convex along L^-Wasserstein 
geodesies in ^(M*^), for m > 1 — ^. 
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However, in this paper we shall show that the discrete analogue fails in general. We present 
a counterexample (Proposition 4.5) in the case m = 2, which shows that even in one dimension 
the Renyi entropy fails to be convex along geodesies in the associated non-local transportation 
metric. 

Proposition 1.5. For N > 6, let Q be the generator of simple random walk on the dis- 
crete circle T^v = Z/NZ. Let W be the non-local transportation metric associated with the 
arithmetic mean 02- Then the Renyi entropy T2 is not convex along W -geodesies. 

It remains an interesting open question whether the Renyi entropy J-m is geodesically 
convex in some non-trivial discrete examples for values of m ^ 1. 

1.4. Gromov-HausdorfF convergence of discrete transportation metrics. Since the 
discrete transportation metrics discussed in this paper take over the role of the L^-Wasserstein 
metric in a discrete setting, it seems natural to ask whether they converge to the L^- 
Wasserstein metric by a suitable limiting procedure. First results in this spirit have been 
obtained in [9], where it was proved that discrete transportation metrics Wtv associated with 
simple random walk on the discrete torus {'L/N'L)'^ converge to the L^-Wasser stein metric 
W2 over the torus T*^. The weight function considered in [9] is the logarithmic mean. In the 
present paper we observe that the result extends to more general weight functions associated 
to porous medium equations. 

Theorem 1.6. Let d > 1, let < m < 2, and let Wn be the renormalised discrete trans- 
portation metric on 'P(T^) associated with simple random walk on and weight function 
6m- Then the metric spaces ('P(T^), Wat) converge to (^(T'^), W2) ii^ the sense of Gromov- 
Hausdorff as N ^ 00. 

Since this result can be obtained by a minor modification of the proof in [9], we do not give 
a detailed proof here, but merely point out the crucial properties which make the argument 
go through. 



In this section we collect some preliminary results on non-local transportation metrics. The 
presentation here is based on [11]. 

In this paper, a reversible Markov chain consists of a triple (Af , Q, vr) where 

• X is a finite set; 

• Q : X X X ^ Wis a Q-matrix, i.e., Q{x, y) > for all x / y aiid Yly^x Q{^-> v) — ^ fo^' 
all X ^ X. Throughout this paper we assume that Q is irreducible, i.e., for all x^y ^ X 
there exist n > 1 and xq, . . .Xn & X such that xq = x, Xn = y and Q{xi-i,Xi) > for 
all 1 < i < n; 

• vr is the associated stationary probability measure on X, which exists uniquely by 
basic Markov chain theory. We impose the standing assumption that vr is reversible, 
i.e., we assume that the detailed balance equations 



2. Preliminaries on non-local transportation metrics 



Q{x,y)TT{x) = Q{y,x)TT{y) 
are satisfied for all x, y G Af . 
The set of probability densities with respect to vr will be denoted by 
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and we set 



:= { p G ^{X) I p{x) > Vx E } . 



[0, oo) having the 



Definition 2.1. A weight function is a function : [0, oo) x [0, oo) 

following properties: 

(Al) 9 is continuous on [0, oo) x [0, oo) and on (0, oo) x (0, oo); 
(A2) e{s, t) = 9{t, s) for s, t > 0; 
(A3) e{s,t) >Ofors,t> 0; 

(A4) e{r, s) < e{r, t) for all r > and < s < t; 
(A5) 9 : [0, oo) x [0, oo) — )■ [0,oo) is concave. 

Note that (A5) implies in particular the following estimate: 

9{2s,2t) <29{s,t) \/s,t>0. 

Let us fix a reversible Markov chain (X, Q, vr) and the weight function 9 throughout the 
remainder of this section. We recall the definition of the associated non-local transportation 
metric from [11], as given in a slightly different (but equivalent) form in [8]. Actually, in 
[11, 8] a slightly less general case has been considered, which corresponds to Q{x,x) = —1 
for all X £ X. However, it is easily checked that the results extend to the present setting 
verbatim. 

Definition 2.2 (Non-local transportation metrics). Fix a weight function 9. For po,pi € 
^{X) we define 

\2 



W{po,piy :=inf 



A{pt,ipt) dt : {p,ip) eC£{po,pi] 



where C£{po,pi) denotes the collection of pairs {p,ip) satisfying the following conditions: 
(i) p: [0, 1] ^ M-^ is ; 

(a) po = po , Pi = Pi ; 

{Hi) pt G ^{X) for all t G [0, 1] ; 

(iv) ip : [0, 1] M-^ is measurable ; (2.2) 

(v) For all X £ X and all t G (0, 1) we have 
dtPt{x) + ^ {i^t{y) - 'ilJt{x))pt{x,y)Q{x,y) = , 

and for p G ,^{X) and ?/; G R-^, 

-^(p-V') ■=\ J2 (^*^2/) - p{x,y)Q{x,y)TT{x) . 

x,y£X 

The equation appearing in (2.2) can be regarded as a "discrete continuity equation". The 
analogy with the continuous case becomes more apparent if we introduce some notation. For 
a function ^ : — )■ M, we consider the discrete gradient : x — >• R given by 

VV'(x,y) := i^{y) - V'(x) . 

For a function ^ : X y. X we define its discrete divergence V • ^ : A" — )• R by 

(V • ^){x) := ^ ^i^{x,y) - ^{y,x))Qix,y) G R . 
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With this notation the integration by parts formula 

holds. Here we write, for ip,ip : X ^ R and ^ : A' x A:" — )■ M, 



(«>,^)^ = ^ <^{x,y)^{x,y)K{x,y)TT{x) 



2 

The continuity equation above can now be written as 

dtPt + V • {ptV^t) = . 
Moreover, for p S ^(X) we shall write 

(«>,^)p:=^ Y '^{x,y)'^{x,y)p{x,y)Q{x,y)Tr{x) , \\^\\p := -^(^',^)p . 



With this notation we have 



A{p,^P) := ||V^||2. 



Basic properties of W have been studied in [11]. Under the current assumptions we have 
the following result. 

Proposition 2.3 (Metric structure). The function W : 3^{X^ x ^{X^ — R defines a metric 
on 3^{X). Moreover, (^(Af),W) is a geodesic space, i.e., for all po,pi G ^{X) there exists 
a curve p : [0, 1] — )■ ^{X) satisfying po = po, pi = pi, and 

Wips,pt) = \s-t\W{po,pi) 

for all s,t G [0, 1]. 

Proof. Under somewhat weaker assumptions, it has been proved in [11] that W defines an 
extended, i.e., (possibly [0, oo]-valued) metric. To show that W is finite under the current 
assumptions, we apply [11, Theorem 3.12], which asserts that it suffices to check that 



Cg := / — , dr < 00 . 

Jo y/e{l-r,l+r) 

Using the concavity assumption (A5) we infer that 

0(1 - r, 1 + r) > ^0{2, 0) + ^0{2, 0) 

for r G [— 1, 1], which implies that Cq < 00. 

The final assertion has been proved in [8, Theorem 3.2]. □ 

Some basic properties of the metric W are collected in the following result, which asserts 
that the distance W is induced by a Riemannian metric on the interior 3^^,{X) of ^{X). 

Proposition 2.4 (Riemannian structure). The restriction o/W to ^^{X) is the Riemannian 
distance induced by the following Riemannian structure: 



DISCRETE POROUS MEDIUM EQUATIONS 7 

- the tangent space of p £ can be identified with the set of discrete gradients 

Tp := {V^ : V e 

by means of the following identification: given a smooth curve {—£,e) 3 t pt £ 
with pq = p, there exists a unique element V^o £ Tp, such that the continuity 
equation {2.2) (v) holds at t = 0. 

- The Riemannian metric on Tp is given by the inner product 

{Vip, Vip)p = {'^{x) - 'P{y)){ip{x) - tp{y))p{x, y)Q{x, y)Tr{x) . 

The following result characterises the geodesic equations for W. The equation for is 
reminiscent of the Hamilton-Jacobi equation, which describes geodesies in the Wasserstein 
space. 

Proposition 2.5 (Geodesic equations). Let p £ and -ip € M'^. On a sufficiently small 

time interval around 0, the unique constant speed geodesic with pQ = p and initial tangent 
vector VV'o = satisfies the following equations: 



dtPt{x) + - 'ipt{x))pt{x,y)Q{x,y) = , 



s/e-^ (2 3) 

dtMx) + \Y1 (^*(^) - My))^did{pt{x),pt{y))Q{x,y) = . 

Let A be the discrete Laplacian associated with Q, i.e., for a function ^/^ : A' — t- R we set 

A^(x) := Qix, y){^{y) - Hx)) . (2.4) 

The operator A is the generator of the continuous time Markov semigroup associated with 
Q. Note that we have the usual formula A = (V-)V. By the reversibility assumption, A is 
selfadjoint on L?'{X,tt). 

Let / G C([0, oo); M) be a strictly convex function, which is smooth on (0, oo). We consider 
the associated entropy functional J- : ^{X) — ?■ M defined by 

Hp) ■■= E f(p(^)M^) ■ 

The following result explains the relevance of the non-local transportation metrics W. 

Proposition 2.6 (Gradient flows). The heat flow generated by A is the gradient flow of the 
entropy with respect to W, provided that 6 is given by 

Let us note that among all weight functions 9 of the form (2.5), the logarithmic mean (1.3) 
is the only one satisfying 9{r,r) = r. 

3. Discrete porous medium equations as gradient flows of the entropy 

In this note we shall be concerned with equations of porous medium-type associated with 
a reversible Markov chain {X,Q,tt). The following assumption will be in force throughout 
this section. 
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Assumption 3.1. The function 99 : [0,oo) — )■ M is strictly increasing, C°° on (0,oo), and 
continuous at 0. 

We shall study the discrete porous medium equation 

dtpt = Aifipt) , (3.1) 

where A denotes the discrete Laplacian defined in (2.4). Of course, if (p{r) = r, we recover the 
usual discrete heat equation associated with Q, which has been studied in [5, 11, 14]. We shall 
analyse these equations using gradient flow methods, flrst in a Hilbertian setting of discrete 
Sobolev spaces, and then in a Riemannian setting of non-local transportation metrics. 

3.1. A Hilbertian gradient flow structure. In order to apply Hilbertian gradient flow 
methods we shall introduce a "discrete -space" J^~^. First we observe that, by the 
irreducibility assumption, Ker(A) = lin{l}, where 1 denotes the function identically equal to 
1. Since A is selfadjoint on L'^{X,tt), it follows that the operator A is bijective on 

Ran(A) := ^ip-.X ^ iP{x)7:{x) = o} . 

For V')V'i)V'2 S Ran(A) it thus makes sense to deflne the inner product 

where (•, ■)t^ denotes the L^(Af, 7r)-inner product. The associated norm is given by 

ll^ll^- := VWi^^ = l|VA-Vlk = Jl E (A-V(y)-A-i^(x))'Q(x,y)7r(x) . 

For arbitrary functions ^/^ : — >• M we set = ?/'(x)7r(x) and note that ■0 — c^l belongs 

to Ran(A). A Hilbertian norm on M"^ can then be deflned by 

The equation dtpt = Aip{pt) can now be studied using classical Hilbertian gradient flow 
arguments: 

Proposition 3.2. Let $(r) = ip{s) ds and consider the functional $ : M'^ — ?• M U {+00} 
defined by 

for p G I3^{X), and ^{p) = +00 otherwise. Then the functional $ is strictly convex on ^{X) 
and attains its unique minimum at 1. As a consequence, the following assertions hold for all 
Po e ^(Af).- 

(1) Among all locally absolutely continuous curve {ptjt 'i-n ^{X) with po = po, there exists 
a unique one satisfying the evolution variational inequality 

~\\pt-a\\%.r<^{a)-^{pt) (3.2) 
for all a G ^{X) and a.e. t> 0. 



DISCRETE POROUS MEDIUM EQUATIONS 



9 



(2) For all pQ £ 3^(X) and a.e. t > the curve {pt) satisfies the ordinary differential 
equation 

dtpt = ^^{pt) ■ 

(3) For all po G ^(X) we have pt ^ 1 as t ^ oo. 

Proof. Strict convexity of $ follows from the assumption that is strictly increasing. Part 
(1) follows then from the theory of gradient flows of convex functionals in Hilbert spaces [3, 
Theoreme 3.1] or the more general theory in metric spaces [1, Theorem 4.0.4]. To prove (2), 
it suffices to compute the J^~^ -gradient of To do this, note that for a smooth curve {pt) 
we have 

^*(Pt) = ^(^(Pt),l)7r = {ip{pt),dtpt)n = -{Aip{pt),dtpt)j^--l , 

which shows that the J^~^ -gradient of the functional $ at p G ^(X) is given by — A(^(p), 
as desired. 

Since the gradient of the convex functional $ vanishes at 1, it follows that $ attains 
it minimum at 1. (Alternatively, this follows from Jensen's inequality.) Uniqueness of the 
minimiser follows from the strict convexity of Part (3) is then a consequence of [1, Corollary 
4.0.6]. □ 

3.2. A gradient flow structure for non-local transportation metrics. The gradient 
flow structure described in Proposition 3.2 can be regarded as a discrete analogue of the H~^- 
gradient flow structure of the porous medium equation. Let us now focus on different gradient 
flow structures for the same equation, which are analogous to the Wasserstein-gradient flow 
structure. We refer to [16, Section 2] for a discussion of different gradient flow structures for 
the porous medium equation in the continuous setting. 

Assumption 3.3. The function f : [0, oo) — )• M zs strictly convex, C°° on (0, oo), and 
continuous at 0. Moreover, the function O^j defined by 

^'^^ ' ^ - f'{r)-f'{s) 
for r 7^ s, extends to a weight function on [0, oo) x [0, oo) in the sense of Definition 2.1. 

The associated metric on ^^{X) will be denoted by W;^./. Often, if confusion is unlikely 
to arise, we will simply write W. 

We consider the entropy functional J- : ^{X) — ?• M defined by 

Hp) ■■= E fipi^)X^) ■ (3-3) 

Of course, if /(r) = rlogr we recover the usual relative entropy with respect to vr. 

Proposition 3.4. Let W be any non-local transportation metric. The W-gradient of the 
functional T at p £ ^^{X) is given by V f'{p). 

Here we use the identification of the tangent space provided in Proposition 2.4. 

Proof. Let p G 3^^{X) and pick a smooth curve {pt) satisfying the continuity equation 

dtPt + V • {pt^i^t) = , 
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where we use the notation from Section 2. It then follows that 

^^Hpt) = = -(/'U),v • (ptVVi)). = (v/'u),vvt)p, , 

hence the Wi^j-gradient of the functional at p G is given by Vf^p). □ 

The next result is now an immediate consequence. 

Theorem 3.5. The gradient flow equation of the entropy functional T with respect to the 
metric y^^pj is given by the porous medium equation dtpt = A{p{pt). 

Proof. The gradient flow equation of the smooth functional on is given by 

dtPt + V • {ptVi't) = , V^t = - gradw ^{pt) ■ 

Using Proposition 3.4 and the fact that p'Vf'{p) = 'Vip(p) by the deflnition of 0^, j, we infer 
that 

V • {ptV^lJt) = -V • (ptVf'ipt)) = -A^ipt) , 

hence the gradient flow equation reduces to the porous medium equation dtpt = Aip{pt), as 
desired. □ 

Clearly, the gradient flow structure for the heat equation given in Proposition 2.6 corre- 
sponds to the special case of Theorem 3.5 where ip{r) = r. 

Let us note that also the J^~^-gradient flow structure from Proposition 3.2 is a special case 
of Theorem 3.5. Indeed, if / = then = Moreover, the following result asserts that 
the "^-distance on coincides with the non-local transportation metric Wi, which is 

given by Definition 2.2 in the special case where 9{r, s) = 1 for all r, s > 0. 

Proposition 3.6. For po,pi G we have 

Wpo - Pi||^'-i = WilPcPi) , 
where Wi denotes the discrete transportation metric with 9 = 1. 

Proof. Note that the geodesic equations (2.3) for Wi reduce to 

dtpt = -Atpt , dtipt = . 

Since tp does not depend on t, we infer that the Wi-geodesic between po and pi in ^{X) is 
given by linear interpolation, i.e., pt = {1 — t)po + pi. It follows that Aipt = po — pi for all 
t G [0, 1], and since || • \\p^ = || • ||i we obtain 

Wi(po,Pi)' = dt = IIVA-^PO - Pi)\\l = Wpo - PilL^-i . 

Jo 

□ 

3.3. Porous medium equations and Renyi entropy. Let us now specialise to a partic- 
ularly interesting setting, motivated by the Wasserstein gradient flow structure for porous 
medium equations of the form dtpt = Ap^ in M"" from [16]. 
Let < m < 2 and m 7^ 1, and consider the functions 

^{r) = ^m{r) = , /(r) = /™(r) = -^r^ . 

m — 1 
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In this case the porous medium equation and the entropy functional are given by 

dtpt = Ap^ , T{p) = Fm{p) ■■= pi^r^i^) ' 

m — 1 ^-^ 

thus is the usual Renyi entropy. The weight function is given by 

em{r,s) := V,/„(r,s) := ^ ^m-i _ ,m-i ' (3-4) 
and we let Wm denote the associated non-local transportation metric. Note that 

lim 0m{r,s) = 0i{r,s) , 

m— > 1 

where 9i denotes the logarithmic mean defined in (1.3). The weight function 6m admits the 
remarkable integral representation 

Om{r, s) = 1^ ((1 - a)r'"-i + as""-') ~' da , (3.5) 

which can be checked by an explicit calculation and naturally generalises (1.3). We collect 
here some basic properties of the weight functions associated to the Renyi entropy. 

Lemma 3.7. Let < m < 2. Then the function 6^ defines a weight function with the 
following additional properties: 

(1) (Homogeneity) 9m{\s,Xt) = \6m{s,t) for A > and s,t > 0. 

(2) (Monotonicity in m) For m < m' we have 9mis,t) < 9m'{s,t) for all s,t > 0. 

(3) (Zero at the boundary) 9miO,t) = for all t >0 if and only if m <1. 

Proof. Properties (Al) - (A4) in Definition 2.1 are obvious from either (3.4) or (3.5). 

Concavity of 0.^ follows from (3.5), since an explicit computation shows that the Hessian 
of the function 

F : (r, s) ^ ((1 - a)r™-i + as^-^) ~ 

is given by 

HessF(r,s) = (l-a)a(2-m)((l-a)r™-i+as™-i)'""' ^m-2^m-2 / rs^^ 

which is negative semi-definite for all a G [0, 1]. 

Property (2) follows from the monotonicity in p of L^- "norms", applied on a two-point 
space {0, 1} with probability measure pa '■= (1 — oi)5o + a5i. This monotonicity follows from 
Jensen's inequality and holds for negative p as well. 

Finally, (3) follows from an explicit computation and (1) is obvious. □ 

Let us note that 9m is not a weight function for m > 2, since in this case Minkowski's 
inequality for L'"~^-norms implies that 9m is convex. 

Corollary 3.8. For any reversible Markov chain {X , Q, it) the distance Wm is finite and 
non-increasing in m. More precisely, for any < m <m' <2 and po,Pi £ ^(Af) we have 

yVm'{PO,Pl) < y^'m{PO,Pl) < OO . 

Proof. This is an immediate consequence of the monotonicity of 9m stated in Lemma 3.7 
combined with the equivalent definition of the distance W given in [8, Lemma 2.9]. □ 



12 MATTHIAS ERBAR AND JAN MAAS 

Applied to ip = ipm and f = fm, Theorem 3.5 reduces to the following result. 

Corollary 3.9. Let < m < 2. Then the gradient flow equation of the Renyi entropy Tm 
with respect to the metric Wm is the porous medium equation dtpt = ^pT ■ 

4. Geodesic k-convexity of entropy functionals 

In this section we analyse convexity properties of entropy functionals along geodesies for 
non-local transportation metrics. We fix a reversible Markov chain [X^Q^-k), and we fix 
/ : [0, oo) — >• M satisfying Assumptions 3.1 and 3.3. Throughout this section we set 

Let yy be the associated metric, and let J- be the entropy functional defined in (3.3). 

4.1. The Hessian of entropy functionals. We start by calculating the Hessian of an 
entropy functional J- in the Riemannian structure induced by W. For a probability density 
p € ^■^{X) we introduce the function A^p : A' x A' — t- M defined by 

A^p(x,y) := die{p{x), p{y))l:^ip{p){x) + d2e{p{x), p{y))/\ip{p){y) . 
Furthermore, for a function : A' — )• M we set 

B{p, ^) = \{K^p ■ W, VV>^ - (p • , V{ip'{p)A^))^ 



(4.2) 



= \ J2 (^'(a:^) - V'(y))^Q(a;,y)7r(x) 

x,y,zSX 

X (^die{p{x),p{y)){ip{p{z)) - (^(p(x)))Q(x,z) 

+ d2e{pix),piy)) {^{p{z)) - My)))Q{y. z) 
-\ ^ (i^ix) - '4^{y))p{x,y)Q{x,y)TT{x) 

x,y,z£X 

X (^<f'{p{x))Q{x,z){i^{z) - ^|;{x)) - v'{p{y))Q{y,z){i^{z) - i;{y))) . 

The proof of the following result is an adaptation of [8, Proposition 4.2]. 
Proposition 4.1. For p G ^^:{X) and ip G M*^ we have 

(HessJ-(p)V^ , V^>^ = B(p,V) . 

Proof. Take {pt,'ipt)t satisfying the geodesic equations (2.3). Using the continuity equation 
and (4.1) we obtain 

^T{pt) = -{f'{pt),V-iptVA))^ 

= {vf'{pt),PfVi^t)^ 

= (Vv{pt),Vi^t)- 
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Therefore the second derivative of the entropy is given by 

Using the continuity equation we obtain 

{dtip{pt),Aipt)^ = {if' {pt)dtpu ^iJt) ^ 

= -{V.{pViJt),^'{pt)^^t), 

= {ptV^JuV{^'ipt)A^t))^. 

Furthermore, applying the geodesic equations (2.3) and the detailed balance equations (2.1), 
we infer that 



{A^{pt),dti^t) = -l {Mx)-My))^di9{pt{x),pt{y)) 



2 



X {ip{pt{z)) - ip{pt{x)))Q{x,y)Q{x, z)7r{x) 
\ {Mx) -My))'^(diO{pt{x),pt{y)){f{pt{z)) - 'p{pt{x)))Q{x,z) 



4 



+ d20{ptix),ptiy)){ip{pt{z)) - ip{pt{y)))Q{y, z)\Q{x,y)TT{x 



Combining the latter three identities, we obtain 



2 



(Hess J-(pt)VV't , V^t)p^ = -^^{Pt) 

= -{pf^^t,yAil,t)^ + ]^{k^pt ■ ViJt,ViJt)^, 
which is the desired identity. □ 

Since the Riemannian metric does not necessarily extend continuously to the boundary of 
^{X), it is not obvious that a lower bound At on the Hessian in the interior implies 
geodesic K-convexity in the metric space (^(A'), W). Nevertheless, an Eulerian argument by 
Daneri and Savare [6] can be adapted to the current setting, to show that this is indeed the 
case. We refer to [8, Theorem 4.5] and [13, Proposition 2.1] for the details in the case where 
6 is the logarithmic mean. 

Proposition 4.2. For k G M the following assertions are equivalent: 

(1) For every constant speed geodesic {pt)te[Q.i\ (>^('^))VV) we have 

HPt) < (1 - t)T{p^) + tT{pi) - ^t(l - t)W(po, Pif ■ (4.3) 

(2) For all pQ G ^{X), the solution [pt) to the porous medium equation with po = po 
from Proposition 3.2, satisfies the evolution variational inequality 

^^W(pt,a)2 + '^Wipuaf < T{a) - HPt) (4-4) 
for all a £ ^{X) and a.e. t> 0. 
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(3) For all p G 3^^{X) we have 

HessJ^(p) > K . 

(4) For all p G and ijj gR-^ we have 

B{p,4>) > ^A{p,i;) . 

The evolution variational inequality (4.4) can be regarded as a definition of a (strong) 
notion of a gradient flow in the setting of a metric space. This inequality has been extensively 
studied recently, see for example [1] and [6, Section 3]. 

4.2. Examples. In this section we will study W-geodesic convexity of in some simple 
concrete examples. To simplify notation let us write, for i = 1,2, 

dip{x,y) := die{p{x),p{x)) . 

Set 

Kg := sup{k G M : is K-convex along Wi^j-geodesics} , (4.5) 
with the understanding that sup{0} = — oo. 

4.2.1. The two-point space. We consider the two-point space X = {a, 6} endowed with the 
Q-matrix Q defined by 

Q{a,b)=p, Q{b,a) = q, 
for p,q > 0. In this case the stationary probability measure vr is given by 

7r(a) = — — , 7r(6) 



p + q P + Q 

In this case we have the following characterisation of kq in terms of p and q. 

Proposition 4.3 (Geodesic convexity on the two-point space). Let Q be as above. Then 
KQ = ^ inf |pv^'(r) + q^'is) + 9{r, s) {pf"{r) + qf"{s)) | , 

where r = ^^(1 — a), s = ^^(1 + o.), and the infimum runs over all a G (—1, 1). 
Proof. An explicit computation shows that 

-""■'^>'?T^ /'M°))-/'(P(f)) "''°'"''"'" • 



and 

pq 



B{p,i;) 



p + q 



\ ( qd2p{a, b) - pdip{a, b) ) ( ^{p{a)) - (p{p{b)) 



+ p{a,b)(pip'{p{a))+qip'{p{b)) 



In view of Proposition 4.2, the result follows from these expressions. □ 

The following result provides a simplified expression for kq in the case where vr is symmetric 
and 9 = 9m- The case m = 1 has already been considered in [11, Proposition 2.12]. 
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Corollary 4.4. If p = q and (p = (fm, and f = fm, then 



pm 

ini 



m-1 (1 - a)"" - (1 + q)'' 
2 ae(-i,i) t m (1 - a)™-i - (1 + q) 

+(1-q)™-i + (1 + 

1 a 



_ ^ + (1 + 



,m-2 



pinf, 



ae{-l,l) 



1 — arctanh(a' 



+ 1 



2p, 



< m <2 , m y^l , 
m = 1 . 



Proof. This follows readily from Proposition 4.3. □ 

4.2.2. The discrete circle. As announced in Proposition 1.5, we will exhibit an instance of 
the discrete porous medium equation where convexity fails. For this purpose, we consider 
the discrete circle of length N, i.e., = Tjv = Z/A^Z for some N > 2. All computations 
below are understood modulo N. Let Q denote the discrete Laplacian, normalised so that 
Q{x, y) = q if \x — y\ = 1 and Q{x, y) = otherwise. In this case vr is the uniform probability 
given by Tr(x) = for all x X. 

Proposition 4.5. Let m = 2, let fm, '■Pm o-nd 9m he as in Section 3.3, and let W be the 
associated non-local transportation metric. Let Q be the discrete Laplacian on the discrete 
circle Tjv = Z/AZ with A > 6 defined above. Then the functional J-m is not convex along 
W -geodesies. More precisely, 

qN 

^Q<-—- 

Proof. Since m = 2 we have the simple expressions 

^{r) = f{r) = r\ ^(r, s) = ^ , ai0(r, s) = 520(r, = ^ . (4.6) 
Inserting these into (4.2), we calculate the Hessian of the Renyi entropy as 

2 ^ / 

^) = 4/V ~ V'i+i)^ f phi + Pi+2 + 3/>f + "ipi+i + ^PiPi+i 

i=i ^ 



J^^i'^i- V'i+l)(^i+l - y^pl+l + Pi+l [pi + Pi+2)j ■ 



i=l 

Choosing in particular 

= (Vi,...,Viv) = (0,1,2,..., 2,0) , p={pi,...,pn) = {e,N - {N -l)e,e,...,e) 

we see that only three terms in the first and one term in the second sum are non-zero, and 
we find 

B{p,i,) = -^-^ + 0{e). 

Since A{p, ip) = q + 0{e), the result follows. □ 

Let us remark that in the case where q = A^, the discrete Laplacian converges to the 
continuous Laplacian on the torus T = M/Z. In this limit space, it is well known that the 
Renyi entropy J-2 is convex along 2-Wasserstein geodesies in ^(T). However, the previous 
result shows that the behaviour at the discrete level is very different, since kq < —N^/2. 
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4.3. Consequences of geodesic convexity. In this subsection we will show that convexity 
of the entropy functional along W-geodesics implies a contraction property for solutions to 
the porous medium equation as well as a number of functional inequalities for the invariant 
measure of the Markov chain. These results are in the spirit of the work of Otto-Villani [17]. 
Similar results for the heat flow associated to a finite Markov chain have been obtained in [8]. 

As a first result we single out the following K-contractivity property, which is a direct 
consequence of the evolution variational inequality (4.4). 

Proposition 4.6 (K-Contractivity of the PME). Suppose that kq £M and let {pt)t, if^t)t be 
two solutions of the discrete porous medium equation as given by Proposition 3.2. Then we 
have for all t > 0: 

)^{pt,(7t)<e-^'W{po,ao) . 

Proof. This follows from Proposition 4.2 by applying [6, Proposition 3.1] to the functional 
on the metric space {^{X),W). □ 

We introduce the following functional, which we regard as an analogue of the Fisher infor- 
mation: 

x,y£X 

If / is not differentiable at 0, we use the convention that T[p) = +00 for p ^ l!P^[X). Using 
Proposition 3.4, we see that for p G we have 

l{p) = ||gradw-7="(p)||p • 

The significance of this functional is due to the fact that it gives the change, or dissipation, 
of the entropy functional along solutions of the equation dtpt = A(p{pt), namely: 

^T{pt) = -I{pt) . 

Note that in the setting of Section 3.3, the dissipation functional X,„ associated to <pm and 
fm is given by 

^"^'^^^'■=1^ ^ [p(yr~'-p{xr~'][piyr-Pi^r]Q{x,y)7r{x) . 

x,yeX 

As before, let 1 G ^{X) denotes the density of the stationary distribution, which is 
everywhere equal to 1. It follows from the definition that = /(I)- 
We introduce the following functional inequalities. 

Definition 4.7. The Markov chain {X,Q,tt) satisfies 

(1) an J-WT inequality with constant k G R if for all p G ^{X), 

T{p) - j-(i) < i)^/T{p)- \y^{p. 1)' • (-Fwx(k)) 

(2) an entropy-dissipation inequality with constant A > if for all p G S^i^X'), 

T{p)-T{\)<^X{p) . (EDI(A)) 

The following result relates these inequalities to W-geodesic convexity of the entropy func- 
tional J- . Recall the definition of kq from (4.5). 
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Theorem 4.8. The following assertions hold. 

(1) If kq gM., then Q satisfies FWX{kq). 

(2) If kq > 0, then Q satisfies EDI[kq). 

Proof. The proof follows from similar arguments as the corresponding results in [8] and uses 
the heuristics developed in the continuous case in [17]. However, for the convenience of the 
reader we give a self-contained proof here. 

To prove (1), assume that kq G M. It is sufficient to prove J^>VX(kq) for p G The 
inequality for general p G 0^{X) then follows from an easy approximation argument. So fix 
p G 3^^{X) and let {(t s) s(^[o,i\ be a constant speed geodesic connecting p = uq io 1 = ai. 
Using Proposition 4.2 and rearanging terms in (4.3), we find 

l/x-^_ N x-/, ^^ .^^^u. -,^2 



F{p) - < --[H'ys) - -F(ao) j - ^(1 - s)W{p, ly . (4.7) 
Taking into account the smoothness of {at) near t = 0, we obtain the bound 

^ J^^-^)-^^-^)\ < ||grad^^(.o)IUo lim ^^^^^ = V^W(p, 1) . 

s-s-0 S S 

Hence, the first assertion of the theorem follows by passing to the limit s — )■ in (4.7). 

Let us now prove (2). Assume that kq > 0. From part (1) we know that Q satisfies 
J^yVT[KQ). From this we derive EDI(kq) by an application of Young's inequality: 

xy < cx^ + —y^ Vx, y G M , c > , 
4c 



in which we set x = W(/9, 1), y = \jT{p) and c = □ 

5. GrOMOV-HAUSDORFF CONVERGENCE 

In this final section we discuss the convergence of discrete transportation metrics to the 
Wasserstein metric in a simple setting. 

For A'^ > 2 let = {'L/N'L)'^ be the discrete torus in dimension d > 1, which we regard as 
an approximation of the continuous torus ¥'^=[0,1]'^. We consider the matrix Q : x — )• 
M defined by 

2 



Q{h3) 



-2dN', \i-j\ = l, 
N'^, otherwise. 



Note that the entries are scaled in such a way that the associated discrete Laplacian Ajv 
defined by (1.2) approximates the Laplacian A on T'^. Fix < 9 < 1 and let Wn denote the 
discrete transportation metric corresponding to a weight function O^n. 
The following result has already been announced in the introduction. 

Theorem 5.1. Let d> I and < m < 2. Then the metric spaces ('P(T^), Wat) converge to 
( J^(T'^), W2) in the sense of Gromov-Hausdorff as N ^ 00. 

Since this result can be proved by following the argument in [9], we shall not give a complete 
proof here. Instead, let us point out the three crucial properties oi 9 = 9m which allow us to 
reapply the argument from [9]: 

(1) 9{t, t) = t for all t > 0. 

(2) 9 is concave. 



18 MATTHIAS ERBAR AND JAN MAAS 

(3) For all a, 6 > we have 

1 1 ^ {b-af 1 

e{a,b) 0{a,b) " ab e{a,b) ' 

where 6 denotes the harmonic mean defined by 9{a,b) := 

Assumption (1) is clearly necessary, since it is already checked at the formal level that the 
metrics may converge to a different limit if (1) is violated. The concavity (2) is used in the 
proof in [9] to ensure that the discrete heat semigroup (-P/v(t))t>o contracts the distance: 

WN{PN{t)po,PN{t)pi) < W7v(po,Pl) • 

This estimate is used in regularisation arguments. Finally, the estimate (3) enters the ar- 
gument, since it turns out to be easier to compare the Wasserstein metric to the discrete 
transportation metric Wat, which is defined using the harmonic mean. The estimate (3) can 
then be combined with a regularisation argument, which shows that the difference between 
Wat and Wn becomes negligible as N gets large, provided that the weight function satisfies 
(3). 

Let us note that these properties are satisfied for 6m with < m < 2. Indeed, (1) is 
obvious, and concavity of for < m < 2 has been proved in Lemma 3.7. Moreover, since 
0-1 is the harmonic mean, it follows from the monotonicity of 0m in (proved in Lemma 
3.7) that it suffices to check (3) for m = 2. Li this case an elementary computation shows 
that (3) indeed holds. 
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